An entropy-based measure of morphological information
نویسندگان
چکیده
Traditional approaches to morphology tend to treat inflectional systems not as unstructured sets of forms with shared stems or roots but as structured networks of elements. The interdependency of elements is, as Matthews (1991: 197) notes, ‘the basis of exemplary paradigms’ in the classical grammatical tradition. Although the exemplary patterns and leading forms of traditional descriptions bring out the structure of inflectional systems, traditional accounts are deficient – or at least incomplete – in a number of important respects. In particular, there is no method for measuring the implication structure of a set of forms or, no means of gauging the diagnostic value of specific forms within a set, and no generally accepted way even of identifying the leading forms of a system. The approach outlined in this talk proceeds from the observation that implicational structure involves a type of information, specifically information that forms within a set convey about other forms in that set. Information in this sense corresponds to reduction in uncertainty. The more informative a given form is about a set of forms, the less uncertainty there is about the other forms in the set. In inflectionally complex languages, a speaker who has not encountered all of the forms of a given item is faced with some amount of uncertainty in determining the unencountered forms. If the choice of each form were completely independent, the problem of deducing unencountered forms would reduce to the problem of learning the lexicon of an isolating language. However, in nearly all inflectional systems, there are at least some forms of an item that reduce uncertainty about the other forms of the item. Once these notions are construed in terms of uncertainty reduction, the problem of measuring implicational structure and diagnostic value is susceptible to well-established techniques of analysis. The uncertainty associated with the realization of a paradigm cell correlates with its entropy (Shannon 1948) and the entropy of a paradigm is the sum of the entropies of its cells. The implicational relation between a paradigm cell and a set of cells is modelled by conditional entropy, the amount of uncertainty about the realization of the set that remains once the realization of the cell is known. The diagnostic value of a paradigm cell correlates with the expected conditional entropy of the cell, the average uncertainty remains in the other cells once the realization of the cell is known.
منابع مشابه
A multi agent method for cell formation with uncertain situation, based on information theory
This paper assumes the cell formation problem as a distributed decision network. It proposes an approach based on application and extension of information theory concepts, in order to analyze informational complexity in an agent- based system, due to interdependence between agents. Based on this approach, new quantitative concepts and definitions are proposed in order to measure the amount of t...
متن کاملINFORMATION MEASURES BASED TOPSIS METHOD FOR MULTICRITERIA DECISION MAKING PROBLEM IN INTUITIONISTIC FUZZY ENVIRONMENT
In the fuzzy set theory, information measures play a paramount role in several areas such as decision making, pattern recognition etc. In this paper, similarity measure based on cosine function and entropy measures based on logarithmic function for IFSs are proposed. Comparisons of proposed similarity and entropy measures with the existing ones are listed. Numerical results limpidly betoken th...
متن کاملSHAPLEY FUNCTION BASED INTERVAL-VALUED INTUITIONISTIC FUZZY VIKOR TECHNIQUE FOR CORRELATIVE MULTI-CRITERIA DECISION MAKING PROBLEMS
Interval-valued intuitionistic fuzzy set (IVIFS) has developed to cope with the uncertainty of imprecise human thinking. In the present communication, new entropy and similarity measures for IVIFSs based on exponential function are presented and compared with the existing measures. Numerical results reveal that the proposed information measures attain the higher association with the existing me...
متن کاملEntropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملHESITANT FUZZY INFORMATION MEASURES DERIVED FROM T-NORMS AND S-NORMS
In this contribution, we first introduce the concept of metrical T-norm-based similarity measure for hesitant fuzzy sets (HFSs) {by using the concept of T-norm-based distance measure}. Then,the relationship of the proposed {metrical T-norm-based} similarity {measures} with the {other kind of information measure, called the metrical T-norm-based} entropy measure {is} discussed. The main feature ...
متن کاملTsallis Entropy and Conditional Tsallis Entropy of Fuzzy Partitions
The purpose of this study is to define the concepts of Tsallis entropy and conditional Tsallis entropy of fuzzy partitions and to obtain some results concerning this kind entropy. We show that the Tsallis entropy of fuzzy partitions has the subadditivity and concavity properties. We study this information measure under the refinement and zero mode subset relations. We check the chain rules for ...
متن کامل